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Abstract 

In this paper we consider a transformation which converts uncorrelated networks to correlated 
ones( here by correlation we mean that coordination numbers of two neighbors are not inde- 
pendent). We show that this transformation, which converts edges to nodes and connects them 
according to a deterministic rule, nearly preserves the degree distribution of the network and signif- 
icantly increases the clustering coefficient. This transformation also enables us to relate percolation 
properties of the two networks. 
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I. INTRODUCTION 



One of the oldest and best studied models of networks which have the merit of being 
exactly solvable for many of their properties, are the random graphs of Erdos and Renyi 
|1|, 0]. These graphs consist of nodes any two of which are connected with a probability 
p and left unconnected with a probability 1 — p. Many of the properties of these networks 
can be easily derived by analytical means. Among them are the degree distribution of 
nodes, which turns out to be Poissonian, the average shortest path between any two nodes 
which is of the order of log(A^), and the onset of phase transition for developing a giant 
component which happens as p exceeds a certain critical value. 

Despite their exact solvability and their low diameter, these networks lack some of the 
other crucial properties of real life networks. In particular it is well known that many 
real networks e.g. World Wide Web, social networks, power grids, scientific and artistic 
collaborations, neural and metabolic networks, show clustering or transitivity which is 
absent in Erdos and Renyi graphs. Moreover many real networks do not possess Poissonian 
degree distribution and intensive studies have been made to construct models as close as to 
real networks 0, ||, |, ^ and to study dynamic effects, e.g. spreading of a contact effect, on 
them.0,|]. 

In the past few years an elegant theory has been developed to construct random graphs 
with desirable degree distributions to mimic the degree properties of real networks. It 
appears that Bender and Canfield ||^, [1^ have been the first to propose an algorithm for 
constructing a random graph with a specific degree distribution. We will call the ensemble 
of graphs constructed in this way the Bender Canfield ensemble. It is remarkable that these 
graphs are still exactly solvable to a large extent |10|- However these graphs still have two 
shortcomings. First they do not show correlation in the degree of nearest neighbors and 
second their clustering coefficient vanishes in the large N limit. 

It is important that the degree distribution does not determine by itself the existence or 
lack of correlations. For a specific degree distribution one may have or have not correlations. 
Moreover, it has been observed that correlation is an essential feature of real networks 
which can appear in different forms |Tl|. For instance, a high degree node may be connected 
to other high degree nodes(associative mixing), to low degree nodes(dissociative mixing) or 
with equal probability to both types (neutral mixing) with different resulting behaviors in 
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networks H, |T3|, |T|, [15 . 

For this reason algorithms have been developed to produce correlated networks with certain 
degree distribution ||, 0, |T^, |17 . 



In this paper we will suggest a simple deterministic transformation on the Bender Canfield 
(BC) graphs and show that the transformed graphs are both correlated and clustered in the 
large N limit. Given a BC graph G with nodes and a degree distribution P{k), we 
construct a graph G by assigning nodes to each edge of G. We then connect these new 
nodes if the corresponding edges in G have had a common node in G. We show that 
many of the properties of these transformed networks can be obtained exactly or almost 
exactly. We obtain general formulas for the degree distribution and its correlations for the 
transformed graphs and will obtain also formulas for the clustering coefficients of these new 
graphs. As examples we apply our transformation to Bender Canfield graphs with various 
degree distributions. 

It has been shown by Newman that percolation on BC graphs can be solved by a generating 
function method. The method is applicable due to the fact that in these graphs there is no 
clustering. By applying our transformation to these graphs we can follow similar steps and 
solve percolation on G . The interesting point is that now G is a highly clustered graph for 
which we can solve percolation. 

The paper is structured as follows: In section |T| we give a brief review of Bender Canfield 
ensemble of random graphs having arbitrary degree distributions. In section |ITI| we discuss 
our transformation and derive various properties of general transformed graphs. In section 



rVl we apply our general formulas to graphs with degree distributions of Poissonian, scale 
free and exponential types. We end the paper with a conclusion. 



II. BENDER-CANFIELD ENSEMBLE OF GRAPHS 

Let G denote a graph with A^ edges and L links. Let also the degree distribution of this 
graph be given by the function P{k), that is the fraction of nodes with k neighbors be given 
by P{k). There is an algorithm ^ for constructing graphs whose degree distribution 
corresponds to P{k) for large A^. More specifically given a degree sequence {ki, ^2, ■ " " ^a^) 
corresponding to the desired degree distribution P{k), one takes each node i with ki loose 
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ends (stubs) and then connects each pair of stubs randomly until no loose end remains. 
We call these types of graphs Bender-Canfield or simply BC graphs. Thus we speak of 
Poissonian BC graphs or scale free BC graphs to designate the degree distribution used for 
their construction. 

Many of the properties of BC graphs can be calculated exactly. For example the average 
number of first neighbors of an arbitrary node, denoted by zi is given by 

z,:={k) = J2kP{k). (1) 

k 

It is useful to call a node with k emanating edges a node of type k. Then the probability of 
picking up a node of type k is given by P{k). We can now ask a different question: What 
is the probability q{k) of picking up an edge which belongs to a node of type k + 1. This is 
equal to the fraction of stubs coming out of nodes of type k + 1: 

_ (k + l)Pik + l) _ (k + l)Pik + l) 

If we now follow a link to one of its ends the average number of new links ( the average ratio 
of the number of second to the first neighbors of an arbitrary node ^) will be given by 

As we will see this quantity will play a central role in many of the later derivations. 

We can ask yet another question. What is the probability P{k, k') of picking up an edge 
which is common to a node of type + 1 and a node of type k' + 1?. This probability is 
given by a product which is a reflection of the absence of correlations in these networks, 

P{k,k') =q{k)q{k'). (4) 



One can also calculate the clustering coefficient of these graphs. The result is [10 



z, {k')-{k) ijz^r , . 

{kY ' n(^i)3" 

It is important to note that for many kinds of degree distributions (i.e. those with finite Z2 
and Zl), this clustering coefficient vanishes in the limit of large graph size A^. It can also 
be shown that the ratio — controls the existence of an infinite cluster of connected nodes 

21 
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\LU^ [T8|] for these graphs. For > 1 there is an infinite cluster where the average distance 



between two arbitrary nodes, is of order log{n). Recently it has been found that almost all 
pairs of nodes have the same distance in this cluster . 



It will be convenient to define two generating functions for the BC ensemble corresponding 
to a degree distribution: 

Go(t) := j:t'm G^{t) ■.= Y.t'qik) = -G',{t) (6) 

k A: ^1 

where G'{t) = ^G{t) and in the last relation we have used (0). In terms of these generating 
functions, the average number of first neighbors zi and second neighbors Z2 are simply given 
by 

zi = jGo{t)\t=i Z2 = z,jG,{t)\t=i = ^[,'(1). (7) 

III. TRANSFORMATION OF BC GRAPHS 

Consider a BC graph G with nodes with a degree sequence (fci, ^2, ■ ■ ■ ^at) taken from 
a distribution P{k). We can transform this graph to a new graph G as follows. To each 
link of the original graph, we assign a node of the new graph. We connect any two nodes of 
the new graph if the corresponding links in the original graph have a node in common, see 
fig.(|l|). The number of nodes and links of G denoted respectively by and L respectively 
are determined from the degree distribution of G. We note that each node of type k of G 
contributes k nodes and k{k — l)/2 edges to G. Taking care of the fact that the new nodes 
are counted twice we find: 



N = lj:h (8) 

^ i=l 

1 ^ 

L = - Y.h{h-l) 
^ i=i 

As figure (@-a) shows the degree distribution of the new graph G is given by: 

P{k)= (lirHs), (9) 

r+s=k 

which is nothing but the probability that an arbitrary edge in G is connected to a total of 
k edges at its two end point nodes. 

There are simple relations between the generating functions of G and G. Using (|^) we find: 



FIG. 1: The basic transformation. The fihed circle and solid lines belong to G and empty circles 
and dashed lines belong to G. 



Go{t) = E t'P{k) = E E = E t''"l{r)q{s) = (^)^ (10) 

k k r+s=k r,s ^1 

From this last equation we find 



In view of (E) we have 



Gi{t) := -^Go(t) = (12) 



However since the graph G is a clustered graph as we will see, the ratio of average number 
of second to first neighbors of an arbitrary node, |?- is not given by the expression kq{k) 
as in (^. Instead we resort to a direct counting. As shown in fig. (0-b) if we follow a 
node of G to the right, the number of first neighbors that we find is given by — inherited 
from G. Due to the low clustering of G, the number of second neighbors that we will meet 
will be (f^)^. Thus the total number of second neighbors will be twice this value, that is 
Zo = 2{—Y. The above reasoning indeed shows that 4?- = ^ which in turn means that the 




FIG. 2: a) First neighbors of a node in G. b) Second neighbors of a node in G{ reachable from one 
of its sides). 



conditions |10| for the development of a giant component in G andG are identical. 

We now find the probability k') of finding an edge the end nodes of which have k and 

k' other neighbors. 

Looking at figure (^) , the probability of finding an edge like AB in G is equivalent to finding 
two edges ac and he incident on the same node c in G. For a moment suppose that no other 
edge in G is incident on c. Then it is clear from figure (^) that the nodes A and B will have 
+ 1 and k' + 1 neighbors in G if the nodes a and h will have + 1 and k' + 1 neighbors in G 
respectively. Putting this together we find that in this simple case k') = P{2)q{k)q{k'), 
where P{2) comes from the probability of finding a node like c of degree 2 in G. In general 
the node c may be common to t other edges in G. These extra edges contribute to the total 
number of neighbors of A and B, so that in order for the node A to have a total of A; + 1 
neighbors in G, the node a needs only have k + l — t neighbors in G. A similar statement is 
true also for the node B. Thus instead of the factor q{k)q{k') we will have q{k — t)q{k' — t). 
This should be multiplied by the probability of finding a triplet acb which is proportional 
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c 




FIG. 3: The degrees of the nodes A and S in G are correlated by the vertex c in G. 
to (*+^K*+^) p(/: _|_ 2) and finally summed over t. The final result is 

P{k, k') = ^^±^l^^P(i + 2)q{k - t)q{k' - t), (13) 

t ^2 

In this way correlations are introduced into the graph in the sense that P{k, k') is no longer 
equal to q{k)q{k'). 

It is also possible to calculate the clustering coefficient of G. It is clear that an edge with end 
point nodes of degree r+1 and s+1 in G represents a node of degree r+s in G. Thus the total 
number of potential connections among these first neig hbors is Of these possible 

connections , there are already a number of lil^iii + ^^^^ connections present coming from 
the definition of G. Due to the clustering coefficient of G, there are configurations which 
increase this number for finite graphs. However since in the thermodynamic limit we know 
that the BC graphs have vanishing clustering coefficients we need not worry about these 
contributions. Thus in the thermodynamic limit we have the following formula for clustering 
coefficient of G: 
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r(r - 1) + s(s - 1) , , , . 
C = > -i ^ 7^qir)qis). (14) 

In this way our transformation has introduced a finite clustering coefficient into the BC 
ensemble of graphs. In the next section we will apply this transformation to several well 
known ensembles with specific degree distributions, namely the Poisson, scale free and 
exponential ensembles. 

Finally let us consider percolation on G. For the sake of simplicity, here we consider 
only site percolation but the same analysis can be applied to bond percolation as well. Let 
each node of G be occupied with probability p and denote the probability that an arbitrary 
node belongs to a cluster of size n, by P„. The generating function of this probability is 
denoted by H{x), that is H{x) = J2^=oPnX"'. Using the same procedure as in |jl9l we write 
the following expression for H{x) : 

H{x) = l- p + pxh'^{x), (15) 

where h{x) is the generating function for the number of nodes reachable if we follow the 
neighbors of the node in one of its sides say to the right (i.e. if we follow the corresponding 
link on G to the right) (see fig. (0-b)). The expression for h{x) is obtained recursively as 

oo ^ k 

h{x) = E^('^)E( )fil-p)''~"h'"{x)x' 

k=0 r=0 r 

= Gi{xph{x) + l-p) (16) 
Solution of these two equations will give us the probabilities -P„. 

IV. EXAMPLES 
A. Poisson graphs 

For a poissonian distribution, where P{k) = ^e"^, it is readily verified using () that 
q{k) = p{k). We find from (|]) that 

P(k) = Y. P(r)P(k - r) = = i^e"- (17) 
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FIG. 4: a) plots of D{k) for several values of A(here kmax = 1000) and b) Clustering coefficient. 



where in the last step we have used the binomial distribution. Thus our transformation 
maps a poissonian graph to another poissonian graph, whose average degree is twice the 
original one. However this new graph is completely different from the original one in other 
respects. First there is correlations between the degree of neighbors and second it has a 
finite clustering coefficient even for large graphs (as — > oo). To see this we use (|13|) 
to calculate for the specific case of k = k', the difference D{k) := P{k,k) — q{k)q{k) as a 
function of k for various values of A. The result is shown in fig. (^a). Figure (^b) shows 
the clustering coefficient (calculated from (p!4D ) as a function of A. It is clearly seen in this 
and the other cases considered below that there are non- vanishing correlations in the degree 
distributions. Also these transformed graphs have appreciable value of clustering. Moreover 
it is seen that the clustering coefficient approaches a maximum value of nearly 0.5 for large 
value of average connectivity. The reason is that, in this limit an appreciable fraction of 
the nodes of G have a high degree approximately equal to zi, and thus one can estimate C 
from (|1^ as C ~ 2zl{2z^-i) ^ This explanation applies to the other examples discussed 
below. 
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B. Scale free graphs 



For scale free graphs we have P{k) = Tj-rk ^ where ^(7) = J2T=i ^ ^- From (|l],|^) we find 



Zi = and q{k) = ^^^zjf'- We find from 



Pik) = + 1)(A: + 1 - s)Y-^ (18) 



C(7)' 

It is seen that for A; ^ 1 the above sum is dominated by its first and last terms. Thus for large 
/c, P{k) behaves like k^~"' which in turn gives 7 = 7 — 1. Thus the transformation maintains 
the power law behavior of degree distribution for large degrees. To see this behavior more 
precisely, we go to the continuum limit and convert the above sum into an integral which 
after a little rearrangement can be cast into the form: 

p{k) = (7 - 2f dx[{^- + 1)^ - (x - ^-rr- (19) 

A change of variable a; — | = (| + 1) sin 6' turns this integral into the form 

P(A;) = 2(7-2)2(^ + 1)3-27 rcosO^-^^de (20) 
2 JO 

where sin a = -^^^ . As an example for the case 7 = 5/2 we find 

3 

which as expected, behaves like k~2 for large k. Like the previous example, we calculated 
numerically D{k) and C. The results have been shown in figure (P). 



C. Exponential graphs 

k 1_ 

Finally, let us consider exponential distributions, P{k) = Ae '^o where A = 1 — e '^o is a 
normalizing factor. We find from (Q) q(k) = j^i^ + l)e '"o . Using (|^) we find 



^4 k 



k + 2 



= n — J^^'~ + + 1 - s). (22) 

U ~ ^) s=0 



Converting the sum to integral, we find: 



^^^^ = + + ^^^^ 
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FIG. 5: a) D{k) and b) C for scale free graphs. 




FIG. 6: a)D{k) and b)C for exponential degree distribution. 

We see that transformation does not change the cutoff value, /cq; but produces some 
polynomial terms. 

In figure we display the correlation and the clustering coefficient for this type of degree 
distribution, using equations (0) and ([M]). 
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V. CONCLUSION 



We have introduced a transformation which when apphed to an uncorrelated network 
with low clustering, produces a new correlated network with a considerable clustering. The 
small world property of the graph is not affected by this transformation, since the shortest 
path on two nodes of G is almost the same as the shortest path on two incident links on 
these nodes on G or two nodes on G. We thus have a method to produce ensemble of graphs 
which resemble more closely the real networks while still being solvable in many respects. 
Moreover it will be possible to use this transformation and solve dynamic processes on 
these new graphs. For example we have shown how site percolation can be solved on these 
transformed graphs. 

This transformation can also be applied to already correlated networks. Moreover we 
have considered only the deterministic form of the transformation . In some cases it may 
be useful to introduce a stochastic parameter in the transformation to have more degree of 
freedom. That is the nodes of the transformed graph G, can be connected with a probability 
p if the corresponding edges in G have a common node. 
We hope to investigate these issues in subsequent works. 
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